Search CORE

96 research outputs found

Clustering by non-negative matrix factorization with independent principal component initialization

Author: Gong Liyun
K. Nandi Asoke
Publication venue
Publication date: 09/09/2013
Field of study

Non negative matrix factorization (NMF) is a dimensionality reduction and clustering method, and has been applied to many areas such as bioinformatics, face images classification, and so on. Based on the traditional NMF, researchers recently have put forward several new algorithms on the initialization area to improve its performance. In this paper, we explore the clustering performance of the NMF algorithm, with emphasis on the initialization problem. We propose an initialization method based on independent principal component analysis (IPCA) for NMF. The experiments were carried out on the four real datasets and the results showed that the IPCA-based initialization of NMF gets better clustering of the datasets compared with both random and PCA-based initializations

University of Lincoln Institutional Repository

An enhanced initialization method for non-negative matrix factorization

Author: Gong Liyun
K. Nandi Asoke
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/09/2013
Field of study

Non-negative matrix factorization (NMF) is a dimensionality reduction tool, and has been applied to many areas such as bioinformatics, face image classification, etc. However, it often converges to some local optima because of its random initial NMF factors (W and H matrices). To solve this problem, some researchers have paid much attention to the NMF initialization problem. In this paper, we first apply the k-means clustering to initialize the factor W, and then we calculate the initial factor H using four different initialization methods (three standard and one new). The experiments were carried out on the eight real datasets and the results showed that the proposed method (EIn-NMF) achieved less error and faster convergence compared with both random initialization based NMF and the three standard methods for k-means based NMF

University of Lincoln Institutional Repository

Crossref

Diffusion map for clustering fMRI spatial maps extracted by independent component analysis

Author: Alluri Vinoo
Brattico Elvira
Cong Fengyu
Nandi Asoke K.
Ristaniemi Tapani
Sipola Tuomo
Toiviainen Petri
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 27/09/2013
Field of study

Functional magnetic resonance imaging (fMRI) produces data about activity inside the brain, from which spatial maps can be extracted by independent component analysis (ICA). In datasets, there are n spatial maps that contain p voxels. The number of voxels is very high compared to the number of analyzed spatial maps. Clustering of the spatial maps is usually based on correlation matrices. This usually works well, although such a similarity matrix inherently can explain only a certain amount of the total variance contained in the high-dimensional data where n is relatively small but p is large. For high-dimensional space, it is reasonable to perform dimensionality reduction before clustering. In this research, we used the recently developed diffusion map for dimensionality reduction in conjunction with spectral clustering. This research revealed that the diffusion map based clustering worked as well as the more traditional methods, and produced more compact clusters when needed.Comment: 6 pages. 8 figures. Copyright (c) 2013 IEEE. Published at 2013 IEEE International Workshop on Machine Learning for Signal Processin

arXiv.org e-Print Archive

Crossref

Credit card fraud detection using AdaBoost and majority voting

Author: Lim Chee Peng
Loo Chu Kiong
Nandi Asoke K
Randhawa Kuldeep
Seera Manjeevan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

Credit card fraud is a serious problem in financial services. Billions of dollars are lost due to credit card fraud every year. There is a lack of research studies on analyzing real-world credit card data owing to confidentiality issues. In this paper, machine learning algorithms are used to detect credit card fraud. Standard models are first used. Then, hybrid methods which use AdaBoost and majority voting methods are applied. To evaluate the model efficacy, a publicly available credit card data set is used. Then, a real-world credit card data set from a financial institution is analyzed. In addition, noise is added to the data samples to further assess the robustness of the algorithms. The experimental results positively indicate that the majority voting method achieves good accuracy rates in detecting fraud cases in credit cards

Deakin Research Online

UM Digital Repository

Lightweight Structure-aware Transformer Network for VHR Remote Sensing Image Change Detection

Author: Jin Yaochu
Lei Tao
Lv Zhiyong
Min Chongdan
Nandi Asoke K.
Ning Hailong
Xu Yetong
Publication venue
Publication date: 02/06/2023
Field of study

Popular Transformer networks have been successfully applied to remote sensing (RS) image change detection (CD) identifications and achieve better results than most convolutional neural networks (CNNs), but they still suffer from two main problems. First, the computational complexity of the Transformer grows quadratically with the increase of image spatial resolution, which is unfavorable to very high-resolution (VHR) RS images. Second, these popular Transformer networks tend to ignore the importance of fine-grained features, which results in poor edge integrity and internal tightness for largely changed objects and leads to the loss of small changed objects. To address the above issues, this Letter proposes a Lightweight Structure-aware Transformer (LSAT) network for RS image CD. The proposed LSAT has two advantages. First, a Cross-dimension Interactive Self-attention (CISA) module with linear complexity is designed to replace the vanilla self-attention in visual Transformer, which effectively reduces the computational complexity while improving the feature representation ability of the proposed LSAT. Second, a Structure-aware Enhancement Module (SAEM) is designed to enhance difference features and edge detail information, which can achieve double enhancement by difference refinement and detail aggregation so as to obtain fine-grained features of bi-temporal RS images. Experimental results show that the proposed LSAT achieves significant improvement in detection accuracy and offers a better tradeoff between accuracy and computational costs than most state-of-the-art CD methods for VHR RS images

arXiv.org e-Print Archive

Combined estimation scheme for blind source separation with arbitrary source PDFs,”

Author: Asoke K Nandi
Frank Herrmann
José Millet-Roig
Vicente Zarzoso
Publication venue
Publication date: 01/01/2001
Field of study

An alternative closed-form estimator for blind source separation based on fourth-order statistics is presented. In contrast to other estimators, the new estimator works well when the source kurtosis sum is zero. Arbitrary source PDFs are successfully treated through a combined estimation scheme based on a heuristic decision rule for choosing between the new estimator and an existing estimator

CiteSeerX

Genetic algorithm based equalizer for ultra-wideband wireless communication systems

Author: Asoke K Nandi
Hai Lin
Jingbo Gao
Nazmat Surajudeen-Bakinde
Xu Zhu
Publication venue
Publication date: 01/01/2010
Field of study

CiteSeerX

Towards tunable consensus clustering for studying functional brain connectivity during affective processing

Author: Asoke K. Nandi
Basel Abu-Jamous
Brattico E.
Chao Liu
Elvira Brattico
Ghaemi R.
Haykin S.
Huettel S. A.
Luo C.
Rice J. A.
Saarikallio S.
Vul E.
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 28/12/2016
Field of study

In the past decades, neuroimaging of humans has gained a position of status within neuroscience, and data-driven approaches and functional connectivity analyses of functional magnetic resonance imaging (fMRI) data are increasingly favored to depict the complex architecture of human brains. However, the reliability of these findings is jeopardized by too many analysis methods and sometimes too few samples used, which leads to discord among researchers. We propose a tunable consensus clustering paradigm that aims at overcoming the clustering methods selection problem as well as reliability issues in neuroimaging by means of first applying several analysis methods (three in this study) on multiple datasets and then integrating the clustering results. To validate the method, we applied it to a complex fMRI experiment involving affective processing of hundreds of music clips. We found that brain structures related to visual, reward, and auditory processing have intrinsic spatial patterns of coherent neuroactivity during affective processing. The comparisons between the results obtained from our method and those from each individual clustering algorithm demonstrate that our paradigm has notable advantages over traditional single clustering algorithms in being able to evidence robust connectivity patterns even with complex neuroimaging data involving a variety of stimuli and affective evaluations of them. The consensus clustering method is implemented in the R package “UNCLES” available on http://cran.r-project.org/web/packages/UNCLES/index.html

Crossref

Archivio istituzionale della ricerca - Università di Bari

Brunel University Research Archive

SMART: Unique splitting-while-merging framework for gene clustering

Author: A Thalamuthu
AD Lanterman
AE Teschendorff
AK Jain
Asoke K. Nandi
B Abu-Jamous
B Fritzke
B Fritzke
CR Lin
CS Wallace
D Dembele
D Jiang
David J. Roberts
G Celeux
H Akaike
J Qin
J Rissanen
KY Yeung
L Hubert
L Mavridis
L Zhao
MAT Figueiredo
P Tamayo
PT Spellman
R Xu
R Xu
RJ Cho
Rui Fa
S Bandyopadhyay
S Monti
S Wu
Sergio Gómez
T Kohonen
T Pramila
TR Golub
WM Rand
YJ Zhang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 08/04/2014
Field of study

Copyright @ 2014 Fa et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.Successful clustering algorithms are highly dependent on parameter settings. The clustering performance degrades significantly unless parameters are properly set, and yet, it is difficult to set these parameters a priori. To address this issue, in this paper, we propose a unique splitting-while-merging clustering framework, named “splitting merging awareness tactics” (SMART), which does not require any a priori knowledge of either the number of clusters or even the possible range of this number. Unlike existing self-splitting algorithms, which over-cluster the dataset to a large number of clusters and then merge some similar clusters, our framework has the ability to split and merge clusters automatically during the process and produces the the most reliable clustering results, by intrinsically integrating many clustering techniques and tasks. The SMART framework is implemented with two distinct clustering paradigms in two algorithms: competitive learning and finite mixture model. Nevertheless, within the proposed SMART framework, many other algorithms can be derived for different clustering paradigms. The minimum message length algorithm is integrated into the framework as the clustering selection criterion. The usefulness of the SMART framework and its algorithms is tested in demonstration datasets and simulated gene expression datasets. Moreover, two real microarray gene expression datasets are studied using this approach. Based on the performance of many metrics, all numerical results show that SMART is superior to compared existing self-splitting algorithms and traditional algorithms. Three main properties of the proposed SMART framework are summarized as: (1) needing no parameters dependent on the respective dataset or a priori knowledge about the datasets, (2) extendible to many different applications, (3) offering superior performance compared with counterpart algorithms.National Institute for Health Researc

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Brunel University Research Archive